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Summary 

Cu,Zn superoxide dismutase (SOD1) has been implicated in the familial form of the neurodegen- 
erative disease Amyotrophic Lateral Sclerosis (ALS). It has been suggested that mutant mediated 
SOD1 misfolding/aggregation is an integral part of the pathology of ALS. We study the folding ther- 
modynamics and kinetics of SOD1 using a hybrid molecular dynamics approach. We reproduce the 
experimentally observed SOD1 folding thermodynamics and find that the residues which contribute 
the most to SOD1 thermal stability are also crucial for apparent two-state folding kinetics. Surpris- 
ingly, we find that these residues are located on the surface of the protein and not in the hydrophobic 
core. Mutations in some of the identified residues are found in patients with the disease. We argue 
that the identified residues may play an important role in aggregation. To further characterize the 
folding of SOD1, we study the role of cysteine residues in folding and find that non-native disulfide 
bond formation may significantly alter SOD1 folding dynamics and aggregation propensity. 

Keywords: Cu,Zn superoxide dismutase, Familial Amyotrophic Lateral Sclerosis, molecular dynamics simulations, 
aggregation, misfolding. 
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I. INTRODUCTION 

ALS is the most common motor neuron disease in human adults that is characterized by selective motor neuron 
death 

QEH0- In approximately 10% of the cases of ALS, the disease is inherited, thus called Familial ALS (FALS) 
0. Mutations in the cytoplasmic enzyme SOD1 were identified as the primary cause of approximately 20% of FALS 
cases H Q. More than 90 mutations have been identified so far. The FALS mutations are scattered throughout 
the primary sequence and three-dimensional structure of SOD1, which exists as a homo-dimer in native state (each 
monomer is 153 amino acids in length) pj. One hypothesis to explain the toxic gain-of-function of the mutants is 
that the toxicity is derived from intracellular aggregates and/or failure of SOD1 degradation, which is supported 
by the observation that in both mice and cell culture models, death of mutant neurons is preceded by formation 
of cytoplasmic aggregates containing mutant SOD1 [a, |9j, [lfl(. In addition, SOD1 knockout mice do not develop 
motor neuron disease pj|. Aggregation or misfolding is therefore a characteristic of SOD1 mediated FALS. Toxicity 
may arise through (i) aberrant chemistry due to misfolded mutants [T2I IT3I ITij , (ii) saturation of essential cellular 
machinery such as chaperones and proteasome components MM, ~(~)^ 

gomeric forms of the aggregate may 
themselves be toxic to the cells, as has been suggested for other neurodegenerative diseas es [l7l. 

Aggregation of proteins is mediated by a var iety of factors including native state stability [18|,[l!j, /3-sheet propensity 
|20|. net char ge and overall hydrophobicity 21] . In addition, folding and aggregation are kinetically competing 
processes pol l22l |23| . Thus, it is crucial to study the folding thermodynamics and kinetics of SOD1 in order to 
understand the causes of SOD1 misfolding, aggregation and toxicity. A microscopic picture of SOD1 folding and/or 
aggregation dynamics of SOD1 is not available from either theory or experiment. We reconstruct the SOD1 folding 
mechanism from molecular dynamics simulations. 

Neither the structure of SOD1 aggregates nor the mechanism of aggregation is fully understood. Several ex- 
perimental studies have characterized the folding thermodynamics of SOD1 and the associated FALS mutants 
lH EE El EllH |M H3- Lepock et al. H3 and Rodriguez et al. [H measured the stability of mutants in 
differential scanning calorimetry studies of the irreversible unfolding of the distinctly metallated species of the en- 
zyme, and showed that mutants are destabilized relative to the wild type. In addition, a crystal structure of the 
G37R mutant 31] shows higher atomic displacement parameters (B-factors) for the side-chains compared to the wild 
type, but the backbone conformation is not significantly different, thus indicating greater molecular flexibility in 
some portions of the structure. It was recently shown that the for some mutants, the apo-state (without metals) of 
the mutants is markedly destabilized compared to that of the wild-type, while the holo-state (with metals) of both 
mutants and wild-type is not significantly affected [33]. It is, therefore, possible that the apo-state of SOD1 — the 
subject of our studies — is implicated in misfolding and aggregation. 

Most Cu, Zn superoxide dismutases, including human SOD1, have been shown to undergo irreversible aggregation 
on exposure to temperatures higher than their respective melting temperatures. The presence of free cysteines — 
those that do not form disulfide bonds in the native state — is known to be involved in this phenomenon. It is 
well-established that improper disulfide bond formation, concomitant with cysteine oxidation, possibly enhanced by 
metals in SOD1, is a cause of heat-denatured aggregation [3j|. In the case of bovine SOD1, substitution by site- 
directed mutagenesis of free cysteine residues was found to greatly increase the reversibility of denaturation without 
substantially affecting the conformational stability [25ll2^ ]. Yeast SOD1 [34[ and E. coli SOD1 [35). neither of which 
contain free cysteines, show partial and full reversibility respectively upon denaturation. However, SOD1 from P. 
leiognathi also denatures irreversibly despite having no free cysteines |36(. Clearly, the presence of free cysteines is 
one of the crucial factors (but not the only factor) responsible for irreversible denaturation. One plausible reason for 
the effect of free cysteines is that the formation of improper disulfide bonds makes SOD1 more prone to kinetic traps. 
Therefore, we also study the effect of disulfide bonds on SOD1 folding reversibility. 

An atomic resolution simulation of the folding process using traditional molecular mechanics force-fields is difficult 
by direct computational approaches because of the vast dimensionality of the protein conformational space |37| . 
Simplified models such as the Go model |H |3!| provide a powerful alternative to study folding because of their 
ability to simulate folding on reasonable time scales and to reproduce the basic thermodynamic and kinetic properties 
of proteins both on- |40t liH 1421 ] and off-lattices 0, In the Go model, the energy of the protein is expressed 
as a sum of pairwise native contact energies. A native contact exists between amino acids if they are closer to each 
other than a cut-off distance in the native state. Thus, in the Go approximation, folding can be construed as a 
transition from a state with no or a small number of native contacts to a state with all native contacts. Despite 
their apparent simplicity, native topology-based approaches have yielded results in agreement with experiments, in 
particular for small proteins (< 100 amino acids) |45l l4l| reflecting the underlying simplicity in the folding of these 
proteins. However, a Go model of longer proteins may lead to discrepancy in protein folding dynamics between 
simulation and experiment because sequence-specific interactions may significantly alter the protein folding dynamics. 
Thus, we develop methodology to incorporate sequence specific information of the relative contribution of various 
amino-acids into the Go model of SOD1. 
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II. RESULTS 

We perform DMD simulations using the unsealed Go model, to determine the thermodynamic properties of a model 
of SOD1 (as described in Methods). The temperature dependence of average potential energy is shown in Fig. la. 
There is a sigmoidal increase in the potential energy with increasing temperature. At low temperatures, SOD1 is 
present mostly in its native state, as evidenced by the small (< 2.83 A) root-mean-square deviation, RMSD, from 
the native state (Fig. lb), and the small fluctuations in the radius of gyration, R g , around its native state value (14.2 
A) (data not shown). At high temperatures, SOD1 is unfolded with the RMSD from the native state being greater 
than 40 A, demonstrating the loss of any structural similarity with the native state. Also, R g is approximately three 
times its value in the native state (data not shown), indicating that the average distance between any two amino acids 
is approximately three times that in the native state. 

In a typical trajectory (Fig. 2a) at the temperature corresponding to the mid-point of the abrupt change in potential 
energy (Fig. la) dependence on temperature (defined as the the folding transition temperature Tp), we observe three 
distinct states: folded, unfolded and intermediate. A histogram of potential energies of protein conformations near Tp 
is, therefore, tri- modal (Fig. 2c), instead of the expected bi- modal for a two-state protein, indicating the presence of a 
folding intermediate. The detection of the intermediate state is in contrast with the experimental observation of the 
two-state folding dynamics of the SOD1 monomer (2^. Thus, the unsealed Go model for SOD1 does not accurately 
reproduce the folding dynamics of SOD1. Next, we calculate the pairwise interaction energies in the native state of 
SOD1 monomer and then employ the scaled Go model, to uncover SOD1 folding thermodynamics and kinetics. 

A few contacts make a large thermodynamic contribution to SOD1 native state stability. We decompose 
the stability of SOD1 into pairwise contributions (as described in Methods) that are plotted in the lower triangle in Fig. 
3 (the energy map). In the energy map, some contacts, corresponding to both short-range and long-range interactions, 
have significantly stronger CHARMM interaction energies (from three to five times) than average. We find that there 
are 16 long-range interactions that have large (about five times larger than the average) interaction energies: Lys3- 
Glu21, Lys3-Glul53, Lys30-Glul00, Ser34-Asp96, Lys36-Asp92, Glu40-Lys91, Glu49-Argll5, Arg69-Glu77, His71- 
Asp83, Arg79-Asp83, Arg79-Aspl01, His80-Asp83, Aspl01-Vall03, Glul21-Serl42, Aspl25-Lysl28, Glul33-Lysl36. 
The residues involved in the strong long-range contacts are highlighted in Fig. 4 and are involved in contacts between 
oppositely charged amino acid pairs on the surface of the protein. Out of the 16 identified interactions, 6 have been 
found to be directly disrupted in patients with FALS as a result of point mutations (listed in Table I) in the residues 
making these contacts 2 . In most cases, the mutations correspond to the change of a charged amino acid for an 
oppositely charged or a neutral one, suggesting an important role these key contacts play in SOD1 folding. 

Scaling thermodynamically important contacts modulates folding kinetics in the Go model. We 
test the importance of the identified amino acids in DMD simulations by scaling their contact strength in the Go 
potential. We increase the depth of the interaction well of the identified key contacts to five times their original 
value in the unsealed Go model (6 = 5). Using this scaled Go model, we perform DMD simulations to calculate 
the thermodynamic properties of the SOD1. We find that the potential energy, RMSD and R g (data not shown) 
exhibit the same sigmoidal trend as for the unsealed Go model, but the folding transition temperature increases to 
approximately 1.1 from its earlier value of 1.0. Remarkably, a typical trajectory near the folding transition temperature 
(Tp = 1.15) shows (Fig. 2b) that the intermediate state is no longer populated and the protein only exists in either 
the folded or the unfolded state (Fig. 2b and d) near Tp- Thus, b y s trengthening these key contacts we reproduce 
the experimentally observed two state folding of SOD1 monomer psl l29]|. As control, we randomly select several 
combinations of sixteen contacts and scale them by the same procedure. In such control simulations, we do not 
observe such effect on the folding kinetics (Fig. 2b), thus demonstrating that the effect of strengthening the identified 
contacts is specific. Importantly, this result uncovers a crucial connection between the folding thermodynamics and 
kinetics of SOD1: in our model, the residues that contribute the most to the SOD1 thermodynamic stability of the 
native state are also crucial for keeping the folding kinetics two-state. 

Improper disulfide bonds cause kinetic trapping of SOD1. Human SOD1 is known to aggregate irreversibly 
on heat denaturation [3(j in vitro. One of the reasons for the irreversibility is improper disulfide bond formation 
between free cysteines. There are four cysteine residues in the SOD1 monomer, at positions 6, 57, 111 and 146 of 
which residues 57 and 146 form a disulfide linkage in the native state. We study the effect of non-native disulfide bond 
formation by starting from a fully unfolded conformation, generated at T=2.3 (3> Tp), and cooling the system to low 
temperature T=0.4 (<c Tp) slowly, while allowing the possibility of interactions between any of the four cysteines to 
form improper disulfide bond(s). We model disulfide bonds by an attractive potential (b = 2, 7 = — 1 in Eq. (2)). As 
a control, we repeat the annealing procedure without these non- native disulfide bonds (b = 1, 7 = 1 in Eq. (2)). We 
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observe that out of 20 trajectories with improper disulfide interactions, 4 result in SOD1 structures that are trapped 
in non-native conformations while all of the 20 control trajectories fold to the correct native structure on the same 
time scale, implying that occurrence of improper disulfide linkage increases the number of kinetic traps for the SOD1 
chain en route to its native state. The above observation is statistically significant (the probability of observing this 
result by chance: p-value wO.01). The increased number of kinetic traps makes intermediate states more populated 
and interaction of these intermediate states may make SOD1 prone to aggregation. We conclude that free cysteine 
residues play an important part in the kinetics of folding, and possibly, aggregation. 

III. DISCUSSION 

Experimental evidence suggests that the unfolding of dimeric superoxide dismutase occurs in a three state equilib- 

rium mm. 

D^2M^U (1) 

where D, M and U are SOD1 dimer, monomer and unfolded state respectively. In order to understand the effect 
of mutations on SOD1 folding, it is important to study the effect of mutation on both processes in Eq. Here, 
we study the second process (M— *U) and report thermodynamically and kinctically important residues for SOD1 
monomer folding. 

Using our hybrid method, which incorporates molecular mechanics energetics into the Go model, we identify the 
key residues involved in the folding of the protein. Surprisingly, the key residues are found on the surface of the 
protein and not in its hydrophobic core (Fig. 4). By comparing the identified residues with a comprehensive list 
of mutations associated with FALS 3 , we find that, 6 out of the 16 contacts identified in our simulations are found 
to be implicated in the disease. In Table I, we list those FALS mutations, that disrupt contacts identified from our 
simulations. We find that these mutations replace a charged residue in the wild- type protein to a neutral or oppositely 
charged residue in the mutant, thereby disrupting the electrostatic interaction. The presence of mutations disrupting 
these contacts further underscores the importance of these key contacts and indicates that the disruption of even one 
of these contacts may potentially result in protein misfolding. 

There have been no experimental studies characterizing SOD1 monomer folding kinetics. However, other members of 
the immunoglobulin-like (Ig-like) superfold family 47], to which SOD1 belongs, are experimentally well-characterized. 
Clarke and colleagues have established that the folding of pathways of Ig-like proteins share some common features 
|48j. They observe that for five structurally similar Ig-like proteins, the folding rates are correlated with thermal 
stabilities, suggesting a similar folding pathway for all members of the family. Thus, for Ig-like proteins, the kinetic 
importance of residues in folding (0- values is largely determined by the topology of the fold. We, therefore, expect 
that the residues important for SOD1 folding are in structurally equivale nt p ositions to other members of the Ig-like 
super- fold family, whose folding kinetics is experimentally well-studied 48]. We choose Tnf3, the fibronectin type 
III domain of human tenascin, for which, based on ^-values, eight residues in the hydrophobic core were identified 
as part of the folding nucleus We identify residues in SOD1 that are structurall y eq uivalent to the nucleus 

residues in Tnf3 by performing a structural alignment of the two proteins using DALI 51] and list these residues 
in Table II. We discover that two of the residues corresponding to this putative nucleus correspond to the strongly 
interacting key residues identified by our simulations, while three others are nearest neighbors of other residues we 
identify. This is significant because (i) mutations in the nucleus residues, being in proximity to the identified key 
residues, may disrupt the network of interactions of key residues and affect folding kinetics and (ii) in the Go model 
entropic contributions are included as effective energetic interactions. Therefore, the identified key residues may be 
kinctically crucial because of the entropic contribution of the corresponding region of the structure (which is reflected 
by high 0- values of the neighboring nucleus residues). Therefore, kinetic evidence further supports our hypothesis 
that residues important for SOD1 native thermodynamics are important for folding kinetics. 

Richardson and Richardson have pointed out [52] that the interactions between edges of /3-strands are crucial to 
regulate protein aggregation propensity. Naturally occurring /3-sheet proteins prevent aggregation by protecting the 
edges of (3 strands through a multitude of fold-specific mechanisms. In particular, for Ig-like /3-sandwiches, strategically 
placed side-chain charges at the edges and loop-crossing are dominant mechanisms which shield the edges of the sheets. 
A recently solved crystal structure of mutant apo-SODl supramolecular assembly [53] identified three regions of the 
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S0D1 structure where non-native gain of interaction was observed leading to amyloid like arrangement of SOD1 in 
crystals: the cleft between strands S5 and S6 (residues 83-100), the zinc loop (residues 65-78) and the electrostatic 
loop (residues 125-142). It was noted that disorder in the zinc and electrostatic loops promoted non-native interactions 
with the S5-S6 cleft, which is the interface between the two /3-sheets of SOD1 and hence a likely site for edge-to-edge 
aggregation jH^]. We find that most of the key contacts for proper folding are made by residues in the three regions 
identified above. In particular, we see that contacts on both the edge-strands S5 and S6 play a crucial role in folding 
kinetics. The contacts made by residues Asp92, Asp96 and Lys91 which form the turn in the S5-S6 hairpin are crucial 
to seal the interface between the two /3-sheets and likely prevent gain of non-native interactions, as shown in Fig. 5. 
Thus, our simulations constitute another piece in the growing body of evidence which implicates edge protection 
as the dominant mechanism by which natural proteins avoid aggregation. We postulate that the disruption of key 
interactions is a plausible scenario for mutant mediated aggregation of SOD1. The re-organization of the S5-S6 edge 
may be the dominant feature which distinguishes the aggregation-prone misfolded state of SOD1, from the native 
state. The disruption of charge-charge interactions of the side-chains of the identified key residues, which stabilize the 
/3-barrel architecture of SOD1, might expose the edges of /3-strand main-chain to interact with other structures, thus 
leading to aggregation. Wang and Hecht [^3 have shown that strategically designed mutations in /3-strand overhangs 
disfavor aggregation of normally aggregating proteins, further supporting our hypothesis. 

We show that the presence of oxidisable cysteines enhances the kinetic trapping during folding. In the presence 
of metals, cysteine oxidation is enhanced suggesting a plausible cause for the misfolding of SOD1. However, it has 
recently been suggested 32] that misfolding of SOD1 likely takes place in the apo-enzyme state, when the metals have 
not yet been delivered to SOD1. In the reducing environment of the cell, cysteine-mediated misfolding/aggregation 
of the apoenzyme is, therefore, not a likely mechanism of SOD misfolding/aggregation. However, our observations 
do suggest a generic mechanism for SOD1 misfolding especially for mutations that do not affect the native state 
conformational stability |30|. In the scenario similar to the one we have presented for free cysteines, other mutations 
may affect the folding kinetics of SOD1 by introducing non-native interactions along the folding pathway, leading to 
kinetic trapping without substantially affecting the native state stability. 



IV. METHODS 

The unsealed Go model. We use the Discrete Molecular Dynamics (DMD) algorithm 0, l55l 1561 to study the 
folding thermodynamics of SOD1. DMD has recently been used to study the folding kinetics |45l l46l IBH l58| and 
agg rega tion of proteins [2(],[5!|. We model the SOD1 chain by the beads-on-a-string model developed by Ding et 
al. |4q ] with beads corresponding to all C a and Cp atoms and constraints between neighboring beads to mimic real 
protein flexibility We study the folding of one monomer derived from the crystal structure of SOD1 (Protein data 
bank access code: 1SPD). We use the Go potential [3^ to model the interaction energy, Vij, between Cp atoms (C Q 
for Gly) i and j of SOD1: 

{+oo, \n -Tj\ < r 
jUij, r < \n -rj\<rx , (2) 
0, \n — rj\ > r% 

where — Tj\ is the distance between atoms i and j. The parameters r and rj are the hard-core diameter (set to 
3.25 A) and the cut-off distance (7.5 A) between Cp atoms (C a for glycine) in the native state respectively. We assign 
attractive interaction for native contacts (7 = —1), defined to exist between pairs of residues whose Cp atoms (C a for 
glycine) are closer than 7.5 A in the native state, and repulsive interaction (7 = 1) for non-native contacts, defined 
to exist between Cp atoms (C Q for glycine) which are farther than 7.5 A in the native state. 

The depth of the attractive square-well uy in Eq. @ determines the strength of the attractive interaction between 
two amino acids. First, we consider the case where all contacts are equally strong, i.e. the depth of each attractive 
well is u. We call this case the unsealed Go model and use it to perform DMD simulations of SOD1. We find that our 
model undergoes a collapse transition to its native state through a meta-stable intermediate (see Results) in contrast 
with the experimental observation of two-state folding dynamics of SOD1 monomer [28l |29|. A possible reason for 
the existence of meta-stable intermediates is that in the unsealed Go model, all native contacts have equal strength. 
While such an approximation holds for smaller proteins (such as Src SH3 |4f|) with relatively low number of contacts 
(~ 162), SOD1 monomer has 495 native contacts and the assumption of equal contact strength overestimates the 
population of intermediate species. Thus, in order to include sequence specificity of native interactions, we increase 
the strength of few specific native contacts according to an all-atom model of SOD1 described below by making the 
depth of the attractive well of some chosen native contacts buij, where b is the scale factor. We call this the scaled 
Go model. By scaling Go potential according to an all atom model of the protein, we effectively include the sequence 
information. 
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The scaled Go model. To determine the relative strength of each individual native contact in SOD1, we use an 
effective energy function to decompose the total energy of the protein as a sum of pairwise terms [60l | : 

i j>i 

where the summation is taken is over all pairs of residues of SOD1 and Eij is the energy of interaction between residues 
i and j. To determine the terms Eij , we perform simulations with the molecular mechanics force field CHARMM j^H 
to construct an all atom model of the SOD1 monomer in the native state, starting from its crystal structure (PDB 
accession code:lSPD). Recently, higher resolution crystal structures of apo-SODl have appeared in the literature [62}, 
but our results are not qualitatively affected by starting with the higher resolution structure. We add hydrogen atoms 
and remove any steric clashes occurring in the structure by energy minimization. The interactions which contribute the 
most to the relative contact interaction strength are the non-bonded interactions and the solvent-induced interaction 
screening |60j : 

E i:j = El on - bonded + dj . (4) 

We compute E"° n ~ bonded as the sum of the Lennard- Jones and electrostatic interaction energies. The solvation free 
energy, Gij, measures the screening of the pairwise interaction due to polarization of the solvent. 

We calculate Gij using an implicit description of the solvent |6^|. We approximate the solvent as a continuous 
dielectric medium with a dielectric constant (e = 80) of water and the protein as a continuous medium with dielectric 
constant (e = 1) of vacuum. We compute the solvation energy by solving the Poisson-Boltzmann equation using the 
finite difference method [64[ available in CHARMM. Following Dominy et al. 65], in order to decompose the solvation 
energy into pairwise contributions, we eliminate all charges in the protein except those corresponding to residue i and 
calculate its "self" solvation energy Ui. Similarly, we compute Uj and Uij when charges corresponding to both i and 
j are present. Then, the solvent mediated screening of the interaction is: 

dj = Uij - [Ui + Uj] . (5) 

So, we calculate the effective pairwise energy contribution of a contact in the folded state. 

The values that we thus obtain correspond to the folded state of the protein. In order to calculate the pairwise 
contribution of residues to the stability, we subtract the value of the contact energy in the unfolded state from 
the Eij values obtained above. The unfolded state is an ensemble of conformations. It has been noted that there is 



considerable native like structure in the unfolded state |66Ll67ll68| . meaning that the average environment of a contact 
is likely the same as in the folded state, but this contact is formed less frequently. Therefore, we model the unfolded 
state by simulating the unsealed Go model near Tp and estimating the frequency of native contact formation. The 
total contribution of a given contact to the stability can be written as: 



AEij =Eij(l- Pi j) (6) 

where Eij is the contact energy in the folded state and Pij is the probability of forming the contact in the unfolded 
state. The energy and contact maps are plotted in Fig. 3. 



V. CONCLUSIONS 



We present the folding thermodynamic and kinetic analysis of SOD1 using a hybrid molecular dynamics method. 
We show that our simple model qualitatively reproduces the folding of SOD1 observed in experiments. We find that 
interactions at the edges of SOD1 (3 sheets and the zinc, electrostatic and cross-over loops are the key interactions that 
modulate the folding of SOD1 suggesting that mutants destabilizing these interactions may make the enzyme more 
prone to aggregation. In particular, we identify the S5-S6 cleft (residues 83-100) as a likely site for reorganization to 
yield an aggregation prone misfolded state, an observation consistent with recent experimental evidence |53l ] and the 
negative design paradigm [52| . We also provide a plausible explanation for the phenomenon of increased reversibility 
of folding upon mutation of free cysteine residues. We find good agreement between the positions identified by us 
to be crucial for folding kinetics and those identified by experiments [5(j on other members of the super-fold family 
(Ig-like proteins) to which SOD1 belongs. Further, we find that mutations at six of the residue contacts we identify 
are implicated in FALS. 
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TABLE I: Mutations in the identified kinetically important residues also found in patients with FALS. In the first column, 6 
of the 16 contacts that we identify are shown. The second column lists mutations that disrupt these contacts and are found in 
FALS patients. 



Contact 


Mutation(s) 


Lys3-Glu21 


Glu21Lys, Glu21Gly 


Glu49-Argll5 


Glu49Lys 


Asp96-Ser34 


Asp96Asn 


Lys30-Glul00 


GlulOOLys, GlulOOGly 


Arg79-Aspl01 


AsplOlGly, AsplOlAsn 


Glu49-Argll5 


Argll5Gly 
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TABLE II: Comparison of kinetically important residues. Kinetically important in SOD1 were determined by (i) structural 
comparison with the protein 1TEN |5nj using DALI |5lj| and (ii) by our simulations. Nucleus represents high </>-value residues 
in 1TEN, equivalent positions refer to structurally equivalent positions determined by DALI alignment and key residues in the 
vicinity of equivalent positions identified by our simulations are shown in column 3. N/A refers to no equivalent residue found. 



Nucleus (1TEN) 


Equivalent Position (1SPD) 


Key Residue 


Ile20 


Phe31 


Lys30 


Tyr36 


His48 


Glu49 


Ile48 


Asp83 


Asp83 


Leu50 


Asn86 


N/A 


Ile59 


AsplOl 


AsplOl 


Thr66 


HisllO 


N/A 


Tyr68 


Ilell2 


N/A 


Val70 


Glyll4 


Argll5 
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FIG. 1: SOD1 thermodynamic properties. Dependence on temperature of (a) average energy (b) RMSD in DMD simulations 
with unsealed Go potential. The sigmoidal curves in both (a) and (b) show that SOD1 undergoes a collapse transition. Similar 
curves are obtained for DMD simulations with the scaled Go potential. 
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FIG. 2: Typical DMD trajectories near Tf for (a) unsealed and (b) scaled Go models. One trajectory is shown out of 4 for 
the unsealed Go model and 5 for the scaled Go model. In (b), a control trajectory with randomly chosen contacts (see text) 
strengthened is shown in grey. Histograms of energy values near Tf, for (c) unsealed and (d) scaled Go models, averaged over 
4 and 5 trajectories respectively, are also shown. From (b) and (d) we observe that the intermediate state is not populated in 
the scaled Go model but is present in the unsealed Go model, indicating that the kinetics of the scaled Go model is indeed 
two-state as observed in experiments |2Sl |2S^ . 
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FIG. 3: Contact map (upper triangle) and CHARMM generated energy map (lower triangle) of the SOD1 monomer. The color 
shading in the energy map is assigned according to the interaction energy values; red indicates repulsive interactions and blue 
indicates attractive interactions. Energy values are in kcal/mol. 
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FIG. 4: Two projections (a) and (b) of SOD1 monomer structure highlighting residues making the identified contacts im- 
portant for the two-state folding of SOD1: Lys3-Glu21, Lys3-Glul53, Lys30-Glul00, Ser34-Asp96, Lys36-Asp92, Glu40-Lys91, 
Glu49-Argll5, Arg69-Glu77, His71-Asp83, Arg79-Asp83, Arg79-Aspl01, His80-Asp83, Aspl01-Vall03, Glul21-Serl42, Aspl25- 
Lysl28, Glul33-Lysl36. Figure generated using Kraulis' Molscript program and co-ordinates from PDB structure 1SPD. 
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FIG. 5: S5-S6 cleft. The residues in the cleft which are identified as crucial for making the folding kinetics two-state in our 
model, are highlighted. This cleft represents one of the edges of the /3-barrel and is a possible site for initiation of non-native 
contact formation as also found in Ref. leading to aggregation. 



